Multimodal Indexing of Multilingual News Video
نویسندگان
چکیده
The problems associated with automatic analysis of news telecasts are more severe in a country like India, where there are many national and regional language channels, besides English. In this paper, we present a framework for multimodal analysis of multilingual news telecasts, which can be augmented with tools and techniques for specific news analytics tasks. Further, we focus on a set of techniques for automatic indexing of the news stories based on keywords spotted in speech as well as on the visuals of contemporary and domain interest. English keywords are derived from RSS feed and converted to Indian language equivalents for detection in speech and on ticker texts. Restricting the keyword list to a manageable number results in drastic improvement in indexing performance. We present illustrative examples and detailed experimental results to substantiate our claim.
منابع مشابه
Feature Selection for Trainable Multilingual Broadcast News Segmentation
Indexing and retrieving broadcast news stories within a large collection requires automatic detection of story boundaries. This video news story segmentation can use a wide range of audio, language, video, and image features. In this paper, we investigate the correlation between automatically-derived multimodal features and story boundaries in seven different broadcast news sources in three lan...
متن کاملRetrieving Video Segments Based on Combined Text, Speech and Image Processing
This paper describes a multimedia, multilingual and multimodal research system (CIMWOS) supporting content-based indexing, archiving, retrieval and ondemand delivery of audiovisual content. There are several projects, aiming at developing advanced technologies and systems to tackle the problems encountered in multimedia archiving and indexing [8], [9], [10]. CIMWOS [1] (Combined IMage and WOrd ...
متن کاملCIMWOS: A Multimedia Archiving and Indexing System
This paper describes a multimedia, multilingual and multimodal research system called CIMWOS (Combined IMage and WOrd Spotting). CIMWOS incorporates an extensive set of multimedia technologies, integrating three major subsystems (text, speech, and image processing). It produces a rich collection of XML metadata annotations following the MPEG-7 standard. These XML annotations are further merged ...
متن کاملThe CIMWOS Multimedia Indexing System
We describe a multimedia, multilingual and multimodal research system (CIMWOS) supporting content-based indexing, archiving, retrieval and on-demand delivery of audiovisual content. CIMWOS (Combined IMage and WOrd Spotting) incorporates an extensive set of multimedia technologies by seamless integration of three major components – speech, text and image processing – producing a rich collection ...
متن کاملMultilingual Multimodal Language Processing Using Neural Networks
We live in an increasingly multilingual multimodal world where it is common to find multiple views of the same entity across modalities and languages. For example, news articles which get published in multiple languages are essentially different views of the same entity. Similarly, video, audio and multilingual subtitles are multiple views of the same movie clip. Given the proliferation of such...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. J. Digital Multimedia Broadcasting
دوره 2010 شماره
صفحات -
تاریخ انتشار 2010